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NEW ESTIMATORS OF THE PICKANDS DEPENDENCE 
FUNCTION AND A TEST FOR EXTREME- VALUE DEPENDENCE^ 

By Axel Bucher, Holger Dette and Stanislav Volgushev 

Ruhr-Universitdt Bochum 

We propose a new class of estimators for Pickands dependence 
function which is based on the concept of minimum distance estima- 
tion. An explicit integral representation of the function A*{t), which 
minimizes a weighted L^-distance between the logarithm of the cop- 
ula C{y^~^ ,y^) and functions of the form A{t) log(y) is derived. If the 
unknown copula is an extreme- value copula, the function A* (t) coin- 
cides with Pickands dependence function. Moreover, even if this is not 
the case, the function A* (t) always satisfies the boundary conditions 
of a Pickands dependence function. The estimators are obtained by 
replacing the unknown copula by its empirical counterpart and weak 
convergence of the corresponding process is shown. A comparison 
with the commonly used estimators is performed from a theoretical 
point of view and by means of a simulation study. Our asymptotic 
and numerical results indicate that some of the new estimators out- 
perform the estimators, which were recently proposed by Genest and 
Segers [Ann. Statist. 37 (2009) 2990-3022]. As a by-product of our 
results, we obtain a simple test for the hypothesis of an extreme- value 
copula, which is consistent against all positive quadrant dependent al- 
ternatives satisfying weak differentiability assumptions of first order. 

1. Introduction. The copula provides an elegant margin-free description 
of the dependence structure of a random variable. By the famous theorem 
of Sklar (1959), it follows that the distribution function H of a bivariate 
random variable {X, Y) can be represented in terms of the marginal distri- 
butions F and G of X and Y, that is, 

H{x,y) = C{F{x),Giy)), 
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where C denotes the copula, which characterizes the dependence between X 
and Y. Extreme-value copulas arise naturally as the possible limits of cop- 
ulas of component- wise maxima of independent, identically distributed or 
strongly mixing stationary sequences [see Deheuvels (1984) and Hsing (1989)]. 
These copulas provide flexible tools for modeling joint extremes in risk man- 
agement. An important application of extreme-value copulas appears in the 
modeling of data with positive dependence, and in contrast to the more pop- 
ular class of Archimedean copulas they are not symmetric [see Tawn (1988) 
or Ghoudi, Khoudraji and Rivest (1998)]. Further applications can be found 
in Coles, Heffernan and Tawn (1999) or Cebrian, Denuit and Lambert (2003) 
among others. A copula C is an extreme- value copula if and only if it has 
a representation of the form 

(1.1) C(yi-*,y*) = /W Vy,tG[0,l], 

where ^:[0,1] — [1/2,1] is a convex function satisfying max{s,l — s} < 
^(s) < 1, which is called Pickands dependence function. The representation 
of (1.1) of the extreme- value copula C depends only on the one-dimensional 
function A and statistical inference on a bivariate extreme-value copula C 
may now be reduced to inference on its Pickands dependence function A. 

The problem of estimating Pickands dependence function nonparametri- 
cally has found considerable attention in the literature. Roughly speaking, 
there exist two classes of estimators. The classical nonpar ametric estimator 
is that of Pickands (1981) [see Deheuvels (1991) for its asymptotic properties] 
and several variants have been discussed. Alternative estimators have been 
proposed and investigated in the papers by Caperaa, Fougeres and Genest 
(1997), Jimenez, Villa-Diharce and Flores (2001), HaU and Tajvidi (2000), 
Segers (2007) and Zhang, Wells and Peng (2008), where the last-named au- 
thors also discussed the multivariate case. In most references, the estimators 
of Pickands dependence function are constructed assuming knowledge of the 
marginal distributions. Recently Genest and Segers (2009) proposed rank- 
based versions of the estimators of Pickands (1981) and Caperaa, Fougeres 
and Genest (1997), which do not require knowledge of the marginal distribu- 
tions. In general, all of these estimators are neither convex nor do they satisfy 
the boundary restriction max{t, 1 — t} < A{t) < 1, in particular the endpoint 
constrains ^(0) = ^(1) = 1. However, the estimators can be modified with- 
out changing their asymptotic properties in such a way that these constraints 
are satisfied, see, for example, Fils-Villetard, Guillou and Segers (2008). 

Before the specific model of an extreme-value copula is selected, it is 
necessary to check this assumption by a statistical test, that is a test for the 
hypotheses 

(1.2) Hq-.C^C vs. Hi:C(^C, 

where C denotes the class of all copulas satisfying (1.1). Throughout this pa- 
per, we call (1.2) the hypothesis of extreme-value dependence. The problem 
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of testing this hypothesis has found much less attention in the hterature. To 
our best knowledge, only two tests of extremeness are currently available in 
the literature. The first one was proposed by Ghoudi, Khoudraji and Rivest 
(1998). It exploits the fact that for an extreme- value copula the random 
variable W = H{X, Y) = C{F{X),G{Y)) satisfies the identity 



The properties of this test have been studied by Ben Ghorbal, Genest and 
Neslehova (2009), who determined the finite- and large-sample variance of 
the test statistic. In particular, the test proposed by Ghoudi, Khoudraji and 
Rivest (1998) is not consistent against alternatives satisfying (1.3). The sec- 
ond class of tests was recently introduced by Kojadinovic and Yan (2010) 
who proposed to compare the empirical copula and a copula estimator which 
is constructed from the estimators proposed by Genest and Segers (2009) 
under the assumption of an extreme- value copula. These tests are only con- 
sistent against alternatives that are left tail decreasing in both arguments 
and satisfy strong smoothness assumptions on the copula and convexity as- 
sumptions on an analogue of Pickands dependence function, which are hard 
to verify analytically. 

The present paper has two purposes. The first is the development of some 
alternative estimators of Pickands dependence function using the principle 
of minimum distance estimation. We propose to consider the best approx- 
imation of the logarithm of the empirical copula C evaluated in the point 
(y-^~*,y*), that is, logC{y^~^ ,y^), by functions of the form 



with respect to a weighted L^-distance. It turns out that the minimal dis- 
tance and the corresponding optimal function can be determined explicitly. 
On the basis of this result, and by choosing various weight functions in the 
L^-distance, we obtain an infinite-dimensional class of estimators for the 
function A. Our approach is closely related to the theory of Z-estimation 
and in Section 3 we indicate how this point of view provides several interest- 
ing relationships between the different concepts for constructing estimates 
of Pickands dependence function. 

The second purpose of the paper is to present a new test for the hy- 
pothesis of extreme-value dependence, which is consistent against a much 
broader class of alternatives than the tests which have been proposed so far. 
Here our approach is based on an estimator of a weighted minimum L^- 
distance between the true copula and the class of functions satisfying (1.4) 
and the corresponding tests are consistent with respect to all positive quad- 
rant dependent alternatives satisfying weak differentiability assumptions of 
first order. To our best knowledge, this method provides the first test in 



(1.3) 



1 + 8E[W]-9E[W^]=0. 



(1.4) 



log{y)A{t) 
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this context which is consistent against such a general class of alternatives. 
Moreover, in contrast to Ghoudi, Khoudraji and Rivest (1998) and Kojadi- 
novic and Yan (2010) we also provide a weak convergence result under fixed 
alternative which can be used for studying the power of the test. 

The remaining part of the paper is organized as follows. In Section 2, 
we consider the approximation problem from a theoretical point of view. 
In particular, we derive explicit representations for the minimal L^-distance 
between the logarithm of the copula and its best approximation by a func- 
tion of the form (1.4), which will be the basis for all statistical applications 
in this paper. The new estimators, say An, are defined in Section 3, where 
we also prove weak convergence of the process {^/n{An{t) — ^(i))}iG[o,i] 
in the space of uniformly bounded functions on the interval [0, 1] under 
appropriate assumptions on the weight function used in the L^-distance. 
Furthermore, we give a theoretical and empirical comparison of the new es- 
timators with the estimators proposed in Genest and Segers (2009). We will 
also determine "optimal" estimators in the proposed class by minimizing 
the asymptotic MSE with respect to the choice of the weight function used 
in the L^-distance. In particular, we demonstrate that some of the new esti- 
mators have a substantially smaller asymptotic variance than the estimators 
proposed by the last-named authors. We also provide a simulation study in 
order to investigate the finite sample properties of the different estimates. 
In Section 4, we introduce and investigate the new test of extreme-value 
dependence. In particular, we derive the asymptotic distribution of the test 
statistic under the null hypothesis as well as under the alternative. In order 
to approximate the critical values of the test, we introduce a multiplier boot- 
strap procedure, prove its consistency and study its finite sample properties 
by means of a simulation study. Finally, most of the technical details are 
deferred to the Appendix. 

2. A measure of extreme-value dependence. Let A denote the set of 
all functions A: [0,1] — t- [1/2,1], and define 11 as the copula corresponding 
to independent random variables, that is, Il{u,v) = uv. Throughout this 
paper, we assume that the copula C satisfies C > 11 which holds for any 
extreme- value copula due to the lower bound for the function A. As pointed 
out by Scaillet (2005), this property is equivalent to the concept of positive 
quadrant dependence, that is, 

(2.1) F{X <x,Y <y)>F{X <x)¥{Y <y) V(x,y)EM^ 
For a copula with this property, we define the weighted L^-distance 




where h : [0, 1] — )■ is a continuous weight function. 
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The following result is essential for our approach and provides an explicit 
expression for the best L'^-approximation of the logarithm of the copula by 
the logarithm of a function of the form (1.1) and as a by-product character- 
izes the function A* minimizing Mh (C, A) . 

Theorem 2.1. Assume that the given copula satisfies C > n*^ for some 
K>1 and that the weight function h satisfies {log y)'^h{y) dy < oo. Then 
the function 

A* = aT:gmm{Mh{C,A)\A G A} 
is unique and given by 

(2.3) ^*(t)= V [''-^^^f^h*iy)dy, 

Jo logy 

where the associated weight function h* is defined by 

(2.4) h*{y)=log\y)h{y), ye (0,1), 
and 

(2.5) Bh= f\\ogyfh{y)dy= [\*{y)dy. 

Jo Jo 

Moreover, the minimal L'^ -distance between the logarithms of the given cop- 
ula and the class of functions of the form (1.4) is given by 

(2.6) Mn{C,A*)=f ( (y) d{y, t) - [\a* {t)f dt. 

j(o,i)2V logy / Jo 

Proof. Since C > W^, we get > logC(y^~*,y*) > nlogy and thus 
|logC(y"'^~*,y*)| < K|logy| and all integrals exist. Rewriting the distance 
in (2.2) gives 

M,,{C,A)= f f ( ^""^^f^'^y'^ - A{t)] {\ogyfh{y)dydt 

Jo Jo V logy / 

and the assertion is now obvious. □ 

Note that A*[t) = A{t) if C is an extreme-value copula of the form (1.1) 
with Pickands dependence function A. Furthermore, the following lemma 
shows that the minimizing function A* defined in (2.3) satisfies the boundary 
conditions of Pickands dependence functions. 

Lemma 2.2. Assume that C is a copula satisfying C > H. Then the 
function A* defined in (2.3) has the following properties: 
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(i) ^*(0) = ^*(1) = 1, 

(ii) A*{t)>t\/{l-t), 

(iii) A* {t) < 1. 

Proof. Assertion (i) is obvious. For a proof of (ii), one uses the Frechet- 
Hoeffding bound C{u,v) < u A v [see, e.g., Nelsen (2006)] and obtains the 
assertion by a direct calculation. Similarly, assertion (iii) follows from the 
inequality C > II. □ 

Unfortunately, the function A* is in general not convex for every copula 
satisfying C > II. A counterexample can be derived from Theorem 3.2.2 in 
Nelsen (2006) and is given by the following shuffle of the copula u Av: 

min{n,'i;, 1/2}, ^ m /T77Ti2 

min{«,t) + 1/2 - 



(2.7) C{u,v) 



1/2}, 

(n,t>)G [0,7172] X [^172,1], 
min{n + 1/2 - ^172, v}, 

(n,7;)G [072,1] x [0,072], 
m.m{u,v,u + v+l/2 - 2072}, 

(n,^;)G[072, l]^ 

for which an easy calculation shows that the mapping ti—)- — logC(l/2^^*,l/2*) 
is not convex. Consequently, one can find a weight function h such that the 
corresponding best approximating function A* is not convex. 
With the notation 

(2.8) fy{t) = C{y'-',y'), 

the function A* is convex (for every weight function h) if and only if the func- 
tion gy{t) = — log fy{t) is convex for every y G (0, 1). The following lemma is 
now obvious. 

Lemma 2.3. If the function t ^ fyi^) = C{y^~'^,y'^) is twice differentiable 
and the inequality 

[/;(t)]' > /;(t)/.(t) 

holds for every {y,t) G (0, l)'^, then the best approximation A* defined by (2.3) 
is convex. 



It is worthwhile to mention that the function A* is convex for some fre- 
quently considered classes of copulas, which will be illustrated in the follow- 
ing examples. 
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Example 2.4. Consider the Clayton copula 

(2.9) Cclayton(^x,^;;^) = (^x-^ + ^;-^-l)-^/^ e>0. 
Then a tedious calculation yields 

[m?-f;;{t)fy{t) 

> eiog2(y){Cciayton(y'-*,y*;e)}'+''(3y-^ - 1) > 0, 

where the inequalities follow observing that m{t) = + < m(0) = 

1 + and > 1. Therefore, we obtain from Lemma 2.3 that the best 
approximation A* is convex and corresponds to an extreme-value copula. 

Example 2.5. In the following, we discuss the weight function /ifc(y) = 
—y^ /logy {k > 0) with associated function h^{y) = — y^'logy, which will be 
used later for the construction of the new estimators of Pickands dependence 
function. On the one hand this choice is made for mathematical convenience, 
because it allows an explicit calculations of the asymptotic variance A* in 
specific examples. On the other hand, estimates constructed on the basis 
of this weight function turn out to have good asymptotic and finite sample 
properties (see the discussion in Section 3.7). It follows that 

Bh, = - [ /log2/d2/ = (fc + l)-2 
Jo 

and 

(2.10) A*{t) = -{k + lf [\ogC{y^-\y')y^dy, 

Jo 

which simplifies in the case fc = to the representation 

(2.11) A*it) = - [\ogC{y'-\y')dy. 

Jo 

Example 2.6. In the following, we calculate the minimal distance Mfi,{C, 
A*) and its corresponding best approximation A* for two copula families and 
the associated weight function /i*(y) = —ylogy from Example 2.5. First, we 
investigate the Gaussian copula defined by 

where $ is the standard normal distribution function and $2(t)/o) is the 
distribution function of a bivariate normal random variable with standard 
normally distributed margins and correlation p G [0, 1] . For the limiting cases 
p = and p = 1, we obtain the independence and perfect dependence copula, 
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Fig. 1. Left: minimal distances Mh {C, A*) x 10^ for the Gaussian copula (as a function 
of its correlation coefficient) and for the convex combination of a Gumbel and a Glayton 
copula (as a function of the parameter a m the convex combination). Right: the func- 
tions A* corresponding to the best approximations by functions of the form (1-4)- 

respectively, while for p G (0,1) the copula Cp is not an extreme- value copula. 
The minimal distances are plotted as a function of p in the left part of 
the first line of Figure 1. In the right part, we show some functions A* 
corresponding to the best approximation of the logarithm of the Gaussian 
copula by a function of the form (1.4). We note that all functions A* are 
convex although Cp is only an extreme value copula in the case p = 0. 

In the second example, we consider a convex combination of a Gumbel 
copula with parameter 6i = log 2/ log 1.5 (corresponding to a coefficient of 
tail dependence of 0.5) and a Clayton copula with parameter 02 = 2, that is. 



Ca{u,v) = aCc\s,yton{u,v;e2) + (1 - a)CGumboi(^^,^';6'i), a E [0,1], 
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where the Clayton copula is given in (2.9) and the Gumbel copula is defined 
by 

CGnmhei{u,v; 9) = exp(-{(- log^x)^ + (- logvfY'"), e > 1. 

Note that only the Gumbel copula is an extreme- value copula and obtained 
for a = 0. The minimal distances are depicted in the left part of the lower 
panel of Figure 1 as a function of a. In the right part, we show the func- 
tions A* corresponding to the best approximation of the logarithm of Ca 
by a function of the form (1.4). Again all approximations are convex, which 
means that A* corresponds in fact to an extreme value copula. 



3. A class of minimum distance estimators. 



3.1. Pickands and CFG estimators. Let {Xi,Yi), . . . ,{Xn,Yn) denote 
a sample of independent identically distributed bivariate random variables 
with copula C and marginals F and G. Most of the estimates which have 
been proposed in the literature so far are based on the fact that the random 
variable 

is exponentially distributed with parameter A{t). In particular, we have 
£J[^(i)] = 1/A(t). If the marginal distributions would be known, an estimate 
of A{t) could be obtained by the method of moments. In the case of unknown 
marginals, Genest and Segers (2009) proposed to replace F and G by their 
empirical counterparts and obtained 



/ n 



as a rank-based version of Pickands estimate, where 
and 

^ n n 

(3.1) Fn{Xi) = ^-Y,I{Xj<Xi] and ^^(y,) = _- ^ /{y^- < K,} 

denote the (slightly modified) empirical distribution functions of the sam- 
ples {Xj}"^^^ and {Yj}"^^ at the points Xi and Fj, respectively. Similarly, 
observing the identity £'[log.f (t)] = — log A{t) —7 (here 7 = — /q°° logxe"^ dx 
denotes Euler's constant), they obtained a rank-based version of the estimate 
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proposed by Caperaa, Fougeres and Genest (1997), that is, 
i^FG(t)=exp(^-7-if;ioge;(t)^. 

For illustrative purposes, we finally recall two integral representations for 
the rank-based version of Pickands and CFG estimate, which we use in Sec- 
tion 3.6 to put all estimates considered in this paper in a general context, 
that is, 

(3.2) -^=r^^^^^dy, 

AP^it) Jo y 

(3.3) 7-MogAXr (0= / \ dy, 



logy 
where 

1 " 

(3.4) Cn{u,v) = -y^I{Fn{X,)<u,Gn{Yi)<v} 

1=1 

denotes the empirical copula and Fn{Xi), G„(l^) are defined in (3.1) [see 
Genest and Segers (2009) for more details]. 

3.2. New estimators and weak convergence. Theorem 2.1 suggests to de- 
fine a class of new estimators for Pickands dependence function by replacing 
the unknown copula in (2.3) through the empirical copula defined in (3.4). 
The asymptotic properties of the corresponding estimators will be investi- 
gated in this section. For technical reasons, we require that the argument in 
the logarithm in the representation (2.3) is positive and propose to use the 
estimator 

(3.5) Cn = Cnyn-\ 

where the constant 7 satisfies 7 > 1/2 and the empirical copula Cn is defined 
in (3.4). 

For the subsequent proofs, we will need a result on the weak convergence 
of the empirical copula process with estimated margins. While this problem 
has been considered by many authors [see, e.g., Riischendorf (1976), Ferma- 
nian, Radulovic and Wegkamp (2004) or Tsukahara (2005) among others], 
all of them assume that the copula has continuous partial derivatives on 
the whole unit square [0, 1]^. However, as was pointed out by Segers (2010), 
there is only one extreme-value copula that has this property. Luckily, in 
a remarkable paper Segers (2010) was able to show that the following con- 
dition is sufficient for weak convergence of the empirical copula process 

(3.6) djC exists and is continuous on {(ni,M2) S [0, 1]^|% S (0, 1)} 

(j = 1, 2). This condition can be shown to hold for any extreme- value copula 
with continuously differentiable Pickands function A [see Segers (2010)]. 
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Moreover, under this assumption, the process \/n{Cn — C) shows the same 
Umiting behavior as the empirical copula process \/n{Cn — C), that is, 

(3.7) V^iCn -C)^ Gc, 

where the symbol ~^ denotes weak convergence in /°°[0,1]^. Here, Gc is 
a Gaussian field on the square [0, 1]^ which admits the representation 

Gc(x) = Bc(x) - aiC(x)]Bc(xi, 1) - 92C(x)Bc(l,X2), 

where x = {xi,X2),Mc is a bivariate pinned C-Brownian sheet on the square 
[0, 1]^ with covariance kernel given by 

Cov(Bc(x),Bc(y)) = C(x Ay) - C(x)C(y), 

and the minimum x A y is understood component- wise. Observing the rep- 
resentation (2.3), we obtain the estimator 

Jo logy 

for Pickands dependence function, where C„ is defined in (3.5). Note that 
this relation specifies an infinite-dimensional class of estimators indexed by 
the set of all admissible weight functions. The following results specify the 
asymptotic properties of these estimators. We begin with a slightly more gen- 
eral statement, which shows weak convergence for the weighted integrated 
process 

^1 Cniy'"',y'' 



VnWn,w{t) = Vn log " . , ' w{y,t)dy, 
Jo C{y^ *,?/*) 

where the weight function w : [0, 1]^ — )• M depends on y and t. The result (and 
some arguments in its proof) are also needed in Section 4. 

Theorem 3.1. Assume that for the weight function w: [0, 1]^ — t- M there 
exists a function w : [0, 1] — t- such that 

(3.9) V(y,t)G [0,1]2 \wiy,t)\<wiy), 

(3.10) Ve>0 sup w{y) < oo, 

(3.11) / w{y)y-^dy <oo 

Jo 

for some X> 1. If the copula C satisfies (3.6) and C > 11, then we have for 
an?/ 7 G (1/2, A/2) as n —)• oo 

^/nW„,^„(^) = y/n [ log ^/^^ , ' ^ ] w{y, t) dy 

/o -,2\ -^0 ^{y ,y ) 

- w m f' Gc{y'~',y') , 



in Z°°[0,1]. 
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The following result is now an immediate consequence of Theorem 3.1 us- 
ing w{y,t) := —B^^h*(y) [recall the definition of the associated weight func- 
tion h* in (2.4)] and yields the weak convergence of the process ^/n{An^h — ^*) 
for a broad class of weight functions. 



Theorem 3.2. // the copula C > 11 satisfies condition (3.6) and the 
weight function h satisfies the conditions 



(3.13) 



for all e > sup 

ye[e,i] 



logy 



< oo, 



(3.14) / h*{y){-logy)-^y-Uy<oo 

Jo 

for some A > 1, then we have for any 7 G (1/2, A/2) as 00 

An,h = V^{An,h-A*) ^ Ac,h inl^[0,l], 
where the process Ac h is given by 



(3.15) 



Ac,h\t)-Oh / nt l~t t\ 1 "2/- 

Jo C{y^ *,y*) logy 



Remark 3.3. (a) Conditions (3.13) and (3.14) restrict the behavior of 
the function h* near the boundary of the interval [0, 1]. A simple sufficient 
condition for (3.13) and (3.14) is given by 



sup 

xG[0,l] 



h*{x) 



x^fl 



< 00 



for some a > 0,/3 > 1. In this case, A can be chosen as 1 -|- a/2. 

(b) In the construction discussed so far, it is also possible to use weight 
functions that depend on t, that is, functions of the form h*{y,t). As long 
as h*{y,t) > for {y,t) £ (0,1) x [0,1], the corresponding best approxima- 
tion A* will still be well defined and correspond to the Pickands depen- 
dence function if C is an extreme- value copula. Theorem 3.1 provides the 
asymptotic properties of the corresponding estimator A if we set w{y,t) := 
/i*(y,i)/(— logy) and assume that /q h*{y,t)dy = 1 for all t. However, for 
the sake of a clear presentation, we will only use weight functions that do 
not depend on t. 



Note that Theorem 3.2 is also correct if the given copula is not an extreme- 
value copula. In other words: it establishes weak convergence of the process 
^/n{An^h ~ A*) to a centered Gaussian process, where A* denotes the func- 
tion corresponding to the best approximation of the logarithm of the cop- 
ula C by a function of the form (1.4). If A* is convex, it corresponds to 
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an extreme-value copula and coincides with Pickands dependence function. 
Note also that Theorem 3.2 excludes the case /io(y) = — logy, because con- 
dition (3.14) is not satisfied for this weight function. Nevertheless, under the 
additional assumption that C is an extreme-value copula with twice contin- 
uously differentiable Pickands dependence function A, the assertion of the 
preceding theorem is still valid. 

Theorem 3.4. Assume that C is an extreme-value copula with twice 
continuously differentiable Pickands dependence function A. For the weight 
function h^d/) = —logy, we have for any 7 G (1/2,3/4) as n —t- 00 



Remark 3.5. (a) If the marginals of {X,Y) are independent the dis- 
tribution of the random variable An, ho coincides with the distribution of 
the random variable = — Gn(?/^~*, y*)y~^ dy, which appears as the 
weak limit of the appropriately standardized Pickands estimator; see Gen- 
est and Segers (2009). In fact, a much more general statement is true: by 
using weight functions h*{y,t) depending on t it is possible to obtain for 
any extreme- value copula estimators of the form (3.8) which show the same 
limiting behavior as the estimators proposed by Genest and Segers (2009). 
This already indicates that for any extreme-value copula it is possible to 
find weight functions which will make the new minimum distance estima- 
tors asymptotically at least as efficient (in fact better, as will be shown in 
Section 3.4) as the estimators introduced by Genest and Segers (2009). 

(b) A careful inspection of the proof of Theorem 3.1 reveals that the 
condition C > 11 can be relaxed to C > H** for some k > 1, if one imposes 
stronger conditions on the weight function. 

(c) The estimator depends on the parameter 7 which is used for the 
construction of the statistic Cn = Cn^ n""' . This modification is only made 
for technical purposes and from a practical point of view the behavior of the 
estimators does not change substantially provided that 7 is chosen larger 
than 2/3. 

Remark 3.6. The new estimators can be alternatively motivated ob- 
serving that the identity (1.1) yields the representation ^(t) = logC(y"'^~*,y*)/ 
logy for any y G (0, 1). This leads to a simple class of estimators, that is. 




w 




in l°°[0,l], where An^hoit) 



JilogCn{y'-\y')dy. 



An,5y{t) 



log C'n(y^ *,y*) 
logy 



ye (0,1) 
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where 6y is the Dirac measure at the point y and C„ is defined in (3.5). By 
averaging these estimators with respect to a distribution, say vr, we obtain 
estimators of the form 

Jo logy 

which coincide with the estimators obtained by the concept of best L^-ap- 
proximation. 

3.3. A special class of weight functions. In this subsection, we illustrate 
the results investigating Example 2.5 discussed at the end of Section 2. For 
the associated weight function hj,(x) = —y^logy with A; > 0, we obtain 

(3.16) AnMit) = -ik + l? [\ogCn{y'~',y')y''dy. 

Jo 

The process {^n,hki't)}t£[o,i] converge weakly in /°°[0,1] to the process 
{^c,hk}tG[o,i], which is given by 

(3-17) A,,,,, it) = -ik + If ^§^^y' dy. 



Consequently, for C G C, the asymptotic variance of A^^h^ is obtained as 

(3.18) Var(Ac,hfe(i)) = (^ + 1)^ /" / (T{u,v;t){uvf~^^^'^ dudv, 

Jo Jo 

where the function a is given by 

a{u, v; t) = Cov(Gc(ni~*, n*), Gc(t;^~*, f *))• 

In order to find an explicit expression for these variances, we assume that 
the function A is differentiable and introduce the notation 

H{t) = A{t) - tA\t), u{t) = A{t) + (1 - t)A'{t), 

where A' denotes the derivative of A. The following results can be shown 
by similar arguments as given in Genest and Segers (2009); for details, see 
Biicher, Dette and Volgushev (2010). 

Proposition 3.7. For t G [0, 1], let /i(t) = 1 - /i(t) and u{t) = 1 - u{t). 
If C is an extreme-value copula with Pickands dependence function A, then 
the variance of the random variable ^c,hk (0 given by 

2/i(t)/x(t)(/c + l) 2u{t)i?{t){k + l) 
2k + l + t 2k + 2 -t 
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- ^'^(^^f^Ty^ / (^(^) + {k + i- A{t))- 



i-t 

+ {k + l-t)j^ dsj. 

Note that the hmiting process in (3.15) is a centered Gaussian process. 
This means that, asymptotically, the quaUty of the new estimators [as well 
as of the estimators of Genest and Segers (2009), which show a similar limit- 
ing behavior] is determined by the variance. Based on these observations, we 
will now provide an asymptotic comparison of the new estimators An^hk{t) 
with the estimators investigated by Genest and Segers (2009). Some finite 
sample results will be presented in the following section for various fami- 
lies of copulas. For the sake of brevity, we restrict ourselves to the inde- 
pendence copula n, for which A{t) = 1. In the case k = 0, we obtain from 
Proposition 3.7 the same variance as for the rank-based version of Pickands 
estimator, that is, 

[see Corollary 3.4 in Genest and Segers (2009)] while the case A; > yields 
V ^_ (3 + 4fc)(fc + l)^ t(l-t) 

Vai(An,/,J- 2A; + 1 (2fc + 2 - t)(2A; + 1 + 1) ' 

Investigating the derivative in k, it is easy to see that Var(An,hfc) is strictly 
decreasing in k with 



k 

Therefore, we have 



lim Var(An, 



hk 



t{l-t) 



Var(A;) = Var(An,ho) > Var(An,feJ 

for all A; > with strict inequality for all A; > 0. This means that for the 
independence copula all estimators obtained by our approach with associ- 
ated weight function /i^.(y) — ~y^^'^Zy^ k > 0, have a smaller asymptotic 
variance than the rank-based version of Pickands estimator. On the other 
hand, a comparison with the CFG estimator proposed by Genest and Segers 
(2009) does not provide a clear picture about the superiority of one esti- 
mator and we defer this comparison to the following section, where optimal 
weight functions for the new estimates A^^h are introduced. 
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3.4. Optimal weight functions. In this section, we discuss asymptotically 
optimal weight functions corresponding to the class of estimates introduced 
in Section 3.2. As pointed out in the previous section, from an asymptotic 
point of view the mean squared error of the estimates is dominated by the 
variance and therefore we concentrate on weight functions minimizing the 
asymptotic variance of the estimate An^h- The finite sample properties of the 
mean squared error of the various estimates will be investigated by means 
of a simulation study in Section 3.7. 

Note that an optimal weight function depends on the point t where 
Pickands dependence function has to be estimated and on the unknown 
copula. Therefore, an estimator with an optimal weight function cannot be 
implemented in concrete applications without preliminary knowledge about 
the copula. However, it can serve as a benchmark for user-specified weight 
functions. To be precise, observe that by Theorem 3.2 the variance of the 
limiting process /j is of the form 

(3.19) V{0 = C C h{x, y) di{x) d^y), 

Jo Jo 

where ^ denotes a probability measure on the interval [0, 1] defined by 
(i^(x) = Bj^^h*{x) dx and the kernel kt{x,y) is given by 

Ux, y) = E \ ^^("'~^^^) ^c{y'~'^y') ' 
' C(a:^^*, X*) log X C(y^^*,y*) logy 

It is easy to see that V defines a convex function on the space of all prob- 
ability measures on the interval [0, 1] and the existence of a minimizing 
measure follows if the kernel kt is continuous on [0, 1]^. The following result 
characterizes the minimizer of V and is proved in the Appendix. 

Theorem 3.8. A probability measure rj on the interval [0, 1] minimizes V 
if and only if the inequality 

(3.20) / kt{x,y)di]{y)> / kt{x,y) dr]{x) dr]{y) 



Jo Jo Jo 

is satisfied for all x G [0, 1] . 

Theorem 3.8 can be used to check the optimality of a given weight func- 
tion. For example, if the copula C is given by the independence copula 11 
we have 

, , . (x^ Ay^ — (xyY)(x^^^ Ay^~^ — (xyy~^) 

kt{x,y) = , 

X log X y log y 

and it is easy to see that none of the associated weight functions h^{y) = 
—y^logy with A; > is optimal in the sense that it minimizes the asymp- 
totic variance of the estimate An h with respect to the choice of the weight 
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function. On the other hand, the result is less useful for an explicit compu- 
tation of optimal weight functions. Deriving an analytical expression for the 
optimal weight function seems to be impossible, even for the simple case of 
the independence copula. 

However, approximations to the optimal weight function can easily be 
computed numerically. To be precise we approximate the double integral 
appearing in the representation of Var(A(7 by the finite sum 

N N 

(3.21) ViO ^^J2^iMj,Nktii/N,j/N) = E^KtE, 

i=i j=i 

where N£N,Kt = {kt{i/N,j /N))f^j^^ denotes an NxN matrix, E = iCi,N)f=i 
is an vector of length and ^j^Tv = C((* ~ V-^] represents the mass 

of ^ allocated to the interval ((i — 1)/N,i/N] (i = 1,. . . ,N). Minimizing the 
right-hand side of the above equation with respect to E under the constrains 
Ci,N ^ 0, X^i^i = 1 is a quadratic (convex) optimization problem which 
can be solved by standard methods; see, for example, Nocedal and Wright 
(2006) and approximations of the optimal weight function can be calculated 
with arbitrary precision by increasing N. 

In the remaining part of this section, we will compare the asymptotic vari- 
ance of the Pickands-, the CFG-estimator proposed by Genest and Segers 
(2009) and the new estimates, where the new estimators are based on the 
weight functions discussed in Section 3.3 for two values of k as well as 
on the optimal weights minimizing the right-hand side of (3.21), where we 
set N = 100. In order to compute the solution Hopt, we used the routine 
ipop from the R-package kernlab by Karatzoglou et al. (2004). In the left 
part of Figure 2, we show the asymptotic variances of the different estima- 
tors for the independence copula. We observe that Pickands estimator has 
the largest asymptotic variances (this curve is not displayed in the figure), 
while the CFG estimator of Genest and Segers (2009) yields smaller vari- 
ances than the estimator A„ /j^, but larger asymptotic variances than the 

estimators A^.h^,- On the other hand, the estimate An^hopt corresponding 
to the numerically determined optimal weight function yields a substan- 
tially smaller variance than all other estimates under consideration. In the 
right-hand part of Figure 2, we display the corresponding results for the 
asymmetric negative logistic model [see Joe (1990)] 

(3.22) A{t) = 1 - {mi - t))-' + (V2t)-'}-'/' 

with parameters ■01 = l-,'^2 = 2/3 and 9 G (0,oo) chosen such that the coef- 
ficient of tail dependence is 0.6. We observe that the estimate A^^h^ yields 
the largest asymptotic variance. The CFG estimate proposed by Genest and 
Segers (2009) and the estimate A^^hx show a similar behavior (with minor 
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Fig. 2. Asymptotic variances of various estimators of the Pickands dependence function. 
Left panel: independence copula; right panel: asymmetric negative logistic model. 



advantages for the latter), while the best results are obtained for the new 
estimate corresponding to the optimal weight function. 

We conclude this section with the remark that we have presented a com- 
parison of the different estimators based on the asymptotic variance which 
determines the mean squared error asymptotically. For finite samples, min- 
imizing only the variance might increase the bias and therefore the asymp- 
totic results cannot directly be transferred to applications. In the finite sam- 
ple study presented in Section 3.7, we will demonstrate that not all of the 
asymptotic results yield good predictions for the finite-sample behavior of 
the corresponding estimators. 

3.5. Convex estimates and endpoint corrections. In general, all of the 
estimates discussed so far [including those proposed by Genest and Segers 
(2009)] will neither be convex, nor will they satisfy the other characterizing 
properties of Pickands dependence functions. However, the literature pro- 
vides many proposals on how to enforce these conditions. Various endpoint 
corrections have been proposed by Deheuvels (1991), Segers (2007) or Hall 
and Tajvidi (2000) among others. Fils-Villetard, Guillou and Segers (2008) 
proposed an L^-projection of the estimate of Pickands dependence function 
on a space of partially linear functions which is arbitrarily close to the space 
of all convex functions in A satisfying the conditions of Lemma 2.2. They 
also showed that this transformation decreases the L^-distance between the 
"true" dependence function and the estimate. An alternative concept of con- 
structing convex estimators is based on the greatest convex minor ant, which 
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yields a decrease in the sup-norm, that is, 

sup \A^^^{t)-A{t)\< sup \An{t) - Ait)l 

0<t<l 0<t<l 

where An is any initial estimate of Pickands dependence function and A^^ 
its greatest convex minorant [see, e.g., Marshall (1970), Wang (1986), Robert- 
son, Wright and Dykstra (1996) among others]. It is also possible to combine 
this concept with an endpoint correction calculating the greatest convex mi- 
norant of the function 

t — > (i„,(t) A 1) Vf V (1 - t) 

[see Genest and Segers (2009) who also proposed alternative special endpoint 
corrections for their estimators]. All these methods can be used to produce 
an estimate of A which has the characterizing properties of a Pickands de- 
pendence function. 

3.6. M- and Z-estiniates. As mentioned in the Introduction, a broader 
class of estimates could be obtained by minimizing more general distances 
between the given copula and the class of functions defined by (1.1) and in 
this paragraph we briefly indicate this principle. Consider the best approx- 
imation of the copula C by functions of the form (1.1) with respect to the 
distance 

(3.23) DUC,A) = C [\{C{y'"\y'),y^^'^)w{y,t)dydt, 



JO JO 

where ^ : [0, 1] x [0, 1] — t- M.^ denotes a "distance" and it; is a given weight 
function. Note that the minimization in (3.23) can be carried out by sepa- 
rately minimizing the inner integral for every value of t. Consequently, the 
problem reduces to a one-dimensional minimization problem and assuming 
differentiability it follows that for flxed t the optimal value A*{t) minimizing 
the interior integral in (3.23) is obtained as a solution of the equation 



\y'),y'')w{y,t)dy 



= 0. 

a=A*{t) 



Under suitable assumptions, integration and differentiation can be exchanged 
and we have 



(3.24) [\{C{y'-\y'),y-){logy)y''w{y,t] 
Jo 



dy = 0, 

'it) 



where = denotes the derivative of $ with respect to the second argu- 
ment. In general, the solution of (3.24) is only defined implicitly as a func- 
tional of the copula C. Therefore, if C is replaced through the empirical 
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copula the analysis of the stochastic properties of the corresponding process 
turns out to be extremely difficult because in many cases one has to control 
improper integrals (see the proofs of Theorems 3.1 and 3.4 in the Appendix). 
For the sake of a clear exposition, we do not discuss details in this paper 
and defer these considerations to future research. 

Nevertheless, equation (3.24) yields a different view on the estimation 
problem of Pickands dependence function. Note that the estimate introduced 
in Section 3.2 is obtained by the choice w{y,t) = h{y)Bf^^ and 

^{zi,Z2) = (logzi -logZ2)^; ^{zi,Z2) = -2(logzi - \0gZ2)/z2 

in (3.24). This estimate corresponds to a minimum distance estimate. Sim- 
ilarly, an estimate corresponding to the classical L^-distance is obtained for 
the choice 

$(Z1,Z2) = {Zi - Z2f; ^-(^1,^2) = -2{zi - Z2). 

This yields for (3.24) the equation 



\c{y^-\y')-y'^){\ogyfy-h{-\ogy) 







dy = 0, 

a=A*{t) 



which cannot be solved analytically. The rank-based versions of Pickands and 
the CFG estimator proposed by Genest and Segers (2009) do not correspond 
to M-estimates, but could be considered as Z-estimates obtained from (3.24) 
for the function 

'^{zi,Z2) = [zi - Z2)/Z2 

with w^^u{y) = y^~^ I (— logy)^"'"'^ with jj. = u = and /U = 0, i' = I, respec- 
tively. In fact, this choice leads to a general class of estimators which relates 
the Pickands and the CFG estimate in an interesting way. To be precise, 
note that for i' € [0, 1) equation (3.24) yields 

-1 (C(yi-*,y*)-/{y>e"i})2/'^-i 



/ 

^0 



{-logy)" 



■dy 



(3.25) =/'fa^"'-/fa>r'»"'"^. 



(-logy)"" 



dx. 



Here the case v = 1 has to be interpreted as the limit — >• 1, which yields 
a generalization of the defining equation for the CFG estimate, that is, 

Ju t Jq logy 
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Observing the relation 



f 



oo 



lim log + 



(it = -7 



we obtain the defining equation for the estimate proposed by Genest and 
Segers (2009) [see (3.3)]. Similarly, if i/ G [0,1) it follows from (3.25) 



and we obtain a defining equation for a generalization of the Pickands es- 
timate. The classical case is obtained for n = v = [see Genest and Segers 
(2009) or equation (3.2)], but (3.26) defines many other estimates of this 
type. Therefore, the Pickands and the CFG estimate correspond to the ex- 
treme cases in the class {u)^,i/|// > 0, G [0, 1]}. 

We finally note that there are numerous other functions ^, which could 
be used for the construction of alternative Z-estimates, but most of them do 
not lead to an explicit solution for A*{t). In this sense the CFG-estimator, 
Pickands-estimator and the estimates proposed in this paper could be con- 
sidered as attractive special cases, which can be explicitly represented in 
terms of an integral of the empirical copula. 

3.7. Finite sample properties. In this subsection, we investigate the small 
sample properties of the new estimators by means of a simulation study. 
Especially, we compare the new estimators with the rank-based estimators 
suggested by Genest and Segers (2009), which are most similar in spirit with 
the method proposed in this paper. We study the finite sample behavior 
of the greatest convex minorants of the endpoint corrected versions of the 
various estimators. The new estimators are corrected in a first step by 



corrections proposed in Genest and Segers (2009), then applied (3.27) and 
finally calculated the greatest convex minorant. Hereby, we compare the 
performance of the different statistical procedures which will be used in 
concrete applications and apply the corrections, that are most favorable 
for the respective estimators. The greatest convex minorants are computed 
using the routine gcmlcm from the package fdrtool by Strimmer (2009). All 
results presented here are based on 5,000 simulation runs and the sample 
size is n = 100. 

As estimators, we consider the statistics defined in (3.8) with the weight 
function /i^ and the optimal weight function determined in Section 3.4. An 



(3.26) 
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important question is the choice of the parameter k for the statistic A„ /i^, in 
order to achieve a balance between bias and variance. For this purpose, we 
first study the performance of the estimator An^h^ with respect to different 
choices for the parameter k and consider the asymmetric negative logistic 
model defined in (3.22) and the symmetric mixed model [see Tawn (1988)] 
defined by 

(3.28) A{t) = i-et + ef, G [0, 1] . 

The results for other copula models are similar and are omitted for the 
sake of brevity. For the Pickands dependence function (3.22), we used the 
parameters ^/^i = 1 and '(/'2 = 2/3 such that the coefficient of tail dependence 
is given by p = 2(3^ + 2^)~^/^ and varies in the interval (0,2/3), while the 
parameter 6 e [0, 1] used in (3.28) yields p = 0/2 e [0, 1/2]. 

The quality of an estimator A is measured with respect to mean integrated 
squared error 

MISE(i)=E ( {A{t) - A{t)f dt 

which was computed by taking the average over 5,000 simulated samples. 
The new estimators turned out to be rather robust with respect to the 
choice of the parameter 7 in the definition of the process Cri — \/ Tl ^ 
provided that 7 > 2/3. For this reason, we use 7 = 0.95 throughout this 
section. Analyzing the impact of choosing different values for k, in Figure 3 
we display simulated curves 

(3 29) MISE(^,,J 

mme>oMlSE{An,h,) 

for the asymmetric negative logistic and the mixed models with different 
coefficients of tail dependence p, as well as the maximum over such curves 
for different values of p (solid curves), that is, 

^oo^^ ^^ MISEp(i„,;,J 

(3.30) fci-^max- 



p minf>oMISEp(A„,feJ 

where by MISEp we denote the MISE for the tail dependence coefficient p. 
The curves in (3.29) attain their minima in the optimal k for the respec- 
tive p, and their shapes provide information about the performance of the 
estimators for nonoptimal values of k. The solid curve gives an impression 
about the "worst case" scenario (with respect to p) in every model. The 
simulations indicate, that for n = 100 the optimal values of k for different 
models and tail dependence coefficients lie in the interval [0.2,0.6]. More- 
over, for values of k in this interval the quality of the estimators remains very 
stable. For n = 200, n = 500 and additional models the picture remains quite 
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model aneglog n = 100 model amix n = 100 




Fig. 3. The function defined in (3.29) for various models and coefficients of tail depen- 
dence. The minimum corresponds to the optimal value of k in the weight function hk . The 
solid curve corresponds to the worst case defined by (3.30). The sample size is n — 100 and 
the MISE is calculated by 5, 000 simulation runs. Left panel: asymmetric negative logistic 
model. Right panel: mixed model. 

similar and these results are not depicted for the sake of brevity. We thus rec- 
ommend using k = 0.4 in practical applications. Note that the asymptotic 
analysis in Section 3.4 suggests that the asymptotically optimal k should 
differ substantially for various models. However, this effect is not visible for 
sample size up to n = 500. In these cases, the optimal values for k usually 
varies in the interval [0.2,0.8]. 

Next, we compare the new estimators with rank-based versions of Pickands 
and the CFG estimator proposed by Genest and Segers (2009). In Figure 4, 
the normalized MISE is plotted as a function of the tail dependence pa- 
rameter p for the asymmetric negative logistic and the mixed model, where 
the parameter 9 is chosen in such a way, that the coefficient of tail depen- 
dence p = 2(1 — A(0.5)) varies over the specific range of the corresponding 
model. For each sample, we computed the rank-based versions of Pickands 
estimator, the CFG estimator [see Genest and Segers (2009)] and two of 
the new estimators An^hk = 0.6). In this comparison, we also include 
the estimator A^^hopt which uses the optimal weight function determined in 
Section 3.4. 

Summarizing the results, one can conclude that in general the best per- 
formance is obtained for our new estimator based on the weight function 
with k = 0.4 and k = 0.6, in particular if the coefficient of tail dependence 
is small. A comparison of the two estimators An^^o 4 ™d An^^o e shows that 
the choice k = 0.4 performs slightly better than the choice k = 0.6 in both 
models. In both settings, the MISE obtained by A^^ho 4 and An^ho e is smaller 
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model amix n = 100 



model aneglog n= 100 



CFG 
AsyOpl 
L2k0.4 
L2k0.6 



CFG 

■ ■ ■ AsyOpt 

- - L2.kO,4 

- - L2.k0,6 



\ \ I \ \ \ ] \ \ \ \ \ ] 

1 2 3 4 5 1 2 3 4 5 6 

2(1-A(0.5)) 2(1-A(0.5)) 

Fig. 4. 100 x MISE for various estimators, models and coefficients of tail dependence, 
based on 5,000 samples of size n — 100. 

than the MISE of the CFG esthnator proposed in Genest and Segers (2009) 
if the coefficient of tail dependence is small. On the other hand, the latter 
estimators yield sightly better results for a large coefficient of tail depen- 
dence. The results for rank-based version of the Pickands estimator are not 
depicted, because this estimator yields a uniformly larger MISE. Simulations 
of other scenarios show similar results and are also not displayed for the sake 
of brevity. It is remarkable that the optimal weight function usually yields 
an estimator with a substantially larger MISE than all other estimates if the 
coefficient of tail dependence is small. Similar results can be observed for 
the sample size n = 500 (these results are not depicted) . This indicates that 
the advantages of the asymptotically optimal weight function only start to 
play a role for rather large sample sizes. 

4. A test for an extreme-value dependence. 

4.1. The test statistic and its weak convergence. From the definition of 
the functional Mh{C,A) in (2.2) it is easy to see that, for a strictly positive 
weight function h with h* E L^{0, 1), a copula function C is an extreme- value 
copula if and only if 

min Mh{C, A) = Mh{C, A*) = 0, 
AeA 

where A* denotes the best approximation defined in (2.3). This suggests to 
use Mh{Cn,An,h) as a test statistic for the hypothesis (1.2), that is. 



Hq : C is an extreme- value copula. 
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Recalling the representation (2.6) 

Mh{C,A*) = C C C\y,t)h*{y)dydt-Bh !\A*{t)fdt 
Jo Jo Jo 

with C{y,t) = -logC(y^~*,y*) and defining Cn{y,t) := -logC'n(y^^*,y*) we 
obtain the decomposition 

Mh{Cn,An,h)-Mh{C,A*) 

{Cl{y,t)-C\y,t))-^dydt 



JO (logy)^ 

Bh f Al^(t)-{A*{t)fdt 
Jo 

^ - - - h*(v) 

/ {Cniy,t)-C{y,t))Ciy,t)-^dydt 
Jo Uogyj 

-2Bh [\AnMt)-A*{t))A*{t)dt 
Jo 



10 

1 '■I h*{y) 



(4.1) +/ I {Cn{y,t)-C{y,t)f--^dydt 
Jo Jo i^ogyj 

-Bh !\An,h{t)-A*{t)fdt 
Jo 

^ - - - h*(v) 

/ {Cniy,t)-Ciy,t)){Ciy,t)-A*it){-\ogy))-^dydt 
Jo Uogyj 

+ /' (\cn{y,t)-C{y,t)f^p^dydt 
Jo Jo (logy)^ 

-Bh !\An,h{t)-A*{t)fdt 
Jo 

=: S*! + 52 + 5*3, 

where the last identity defines the terms 5i , S2 and S3 in an obvious man- 
ner. Note that under the null hypothesis of extreme-value dependence we 
have A* = A and thus C{y,t) = A*{t){-logy). This means that under Hq 
the term Si will vanish and the asymptotic distribution will be determined 
by the large sample properties of the random variable 52 + 5*3. Under the 
alternative, the equality C{y,t) = A* (t){—logy) will not hold anymore and 
it turns out that in this case the statistic is asymptotically dominated by the 
random variable Si. With the following results, we will derive the limiting 
distribution of the proposed test statistic under the null hypothesis and the 
alternative. 
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Theorem 4.1. Assume that the given copula C satisfies condition (3.6) 
and is an extreme- value copula with Pickands dependence function A* . If 
the function w{y) := /i*(y)/(logy)^ fulfills conditions (3.10) and (3.11) for 
some A > 2 and the weight function h is strictly positive and satisfies assump- 
tions (3.13), (3.14) for A := A/2 > 1, then we have for any 7 G (1/2, A/4) 
and n — )• 00 



nMh{Cn,An,h) Zo, 
where the random variable Zq is defined by 

'' ■-HI! (lira) * - i' * 

with Bh = J^h*{y)dy and the process {^c,h{t)}t&[Q,i\ defined in Theo- 
rem 3.2. 

The next theorem gives the distribution of the test statistic M/i(C„, j4„ _/,) 
under the alternative. Note that in this case we have Mh{C,A*) > 0. 

Theorem 4.2. Assume that the given copula C satisfies C > H, con- 
dition (3.6) and that Mh{C,A*) > 0. If additionally the weight function h 
is strictly positive and h and the function w{y) := /i*(y)/(logy)^ satisfy the 
assumptions (3.13), (3.14) and (3.10), (3.11) for some A > 1, respectively, 
then we have for any 7 € (1/2, (1 + A)/4 A A/2) and n — t- 00 

V^{Mh{Cn,A) - MhiC,A*)) ^ Zi, 

where the random variable Z\ is defined as 

^ , /" /•' Gc(g'-'.i;') , . 



with 



v{y,t) = (logC7(yi-*,y*) - log{y)A*{t)) 



2 ■ 



(logy) 

Remark 4.3. (a) Note that the weight functions h'^{y) = —y^logy sat- 
isfy the assumptions of Theorems 4.1 and 4.2 for k > 1 and k > 0, respec- 
tively. 

(b) The preceding two theorems yield a consistent asymptotic level a 
test for the hypothesis of extreme-value dependence by rejecting the null 
hypothesis Hq if 

(4.2) nMh{Cn,An^h) > zi-a, 
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where zi-a denotes the (1 — Q)-quantile of the distribution of the random 
variable Zq. 

(c) By Theorem 4.2, the power of the test (4.2) is approximately given by 

P(nM,(C„,i.,,) > « 1 - ^ - V^ ^'^^'^*^ ) 

\Vna a J 



a 

where the function A* is defined in (2.3) corresponding to the best approxi- 
mation of the logarithm of the copula C by a function of the form (1.4), a is 
the standard deviation of the distribution of the random variable Zi and $ is 
the standard normal distribution function. Thus, the power of the test (4.2) 
is an increasing function of the quantity Mh{C^A*)a~^ . 

4.2. Multiplier bootstrap. In general, the distribution of the random vari- 
able Zq cannot be determined explicitly, because of its complicated depen- 
dence on the (unknown) copula C. We hence propose to determine the 
quantiles by the multiplier bootstrap approach as described in Biicher and 
Dette (2010). To be precise, let £,i,---,in denote independent identically 
distributed random variables with 

P(6 = 0) = P(6 = 2) = l/2. 

We define ^„ = XlILi ^« ^ mean of ^i, . . . , ^„ and consider the mul- 
tiplier statistics 

C*^{u,v) = F*^{F-,{u),F-^{v)), 

where 

1 " ^ 

and Fnj denotes the marginal empirical distribution functions. If we estimate 
the partial derivatives of the copula C by 

S-^ Cn{u + h,v) - Cn{u- h,v) 

dMu,v) := , 

^(n,.) := Cn{u,v + h)-dr.iu,v-h)^ 

where h = n~^/^ — )• 0, we can approximate the distribution of Gc by the 
distribution of the process 

(4.3) al'^'^iu, v) := Pn{u, v) - d^{u, v)Pn{u, 1) - d^{u, 7;)/3„(l, v), 
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where f3n{u,v) = y/n{C'^{u^v) — Cniu,v)). More precisely, it was shown by 
Biicher and Dette (2010) that we have weak convergence conditional on the 
data in probability toward Gc, that is, 

(4.4) aP^'^'lGc in/°°[0,l]2, 

p 

where the symbol denotes weak convergence conditional on the data in 
probability as defined by Kosorok (2008), that is, an*^™-^^Gc if 

(4.5) sup |E5/i(qP^'^) - m{Gc)\ A 

heBLi{l°°lO,l]^) 

and 

(4.6) E^h{al'^'^)* - E^/i^-^"^), A for every h G SLi(Z°°[0, 1]^). 
Here 

i?Li(/~[0,l]2) 

= {/ : /~[0, 1]2 ^ M: ll/IU < 1, |/(/3) - f{l)\ < - 7I 



00 

2n 



V7,/3g/°°[0,1]^} 

is the class of all uniformly bounded functions which are Lipschitz continu- 
ous with constant smaller one, and Eg denotes the conditional expectation 
with respect to the weights given the data {Xi,Yi) ■ ■ ■ (X„, y„). As a con- 
sequence, we obtain the following bootstrap approximation for Zq. 

Theorem 4.4. If condition (3.6) is satisfied, the weight function h satis- 
fies the conditions of Theorem 4-1 o-nd /i*(?/)(y logy)~^ is uniformly bounded 
then 

A io V CJ,/-',y') ) (\ogyy- 



Jo \h Cn(y^-',v') logs ) 

converges weakly to Zq conditional on the data, that is, 

Z^^Zq m/°°[0,l]. 



By Theorem 4.4, Zq is a valid bootstrap approximation for the distribu- 
tion of Zq. Consequently, repeating the procedure B times yields a sample 
Zq (1), . . . , Zq{B) that is approximately distributed according to Zq and we 
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can use the empirical (1 — Q)-quantile of this sample, say as an ap- 

proximation for zi-a- Therefore, rejecting the null hypothesis if 

(4.7) nMhiCn,An,h)> zl_^ 

yields a consistent asymptotic level a test for extreme-value dependence. 

Note that the condition on the boundedness of the function h* (y)(y log y)'^ 
is not satisfied for any member of the class h'^{y) = —y^/\og{y) from Exam- 
ple 2.5. Nevertheless, mimicking the procedure from Kojadinovic and Yan 
(2010) and using /i^(y)I[e i_£](y) instead of h*^{y) is sufficient for the bound- 
edness. Since this is the procedure being usually performed in practical ap- 
plications, Theorem 4.4 is still valuable for the weight functions investigated 
in this paper. 



4.3. Finite sample properties. In this subsection, we investigate the fi- 
nite sample properties of the test for extreme-value dependence. We con- 
sider the asymmetric negative logistic model (3.22), the symmetric mixed 
model (3.28) and additionally the symmetric model of Gumbel 

(4.8) A{t) = {t'^ + (l-tff'^ 

with parameter 9 S [1, oo) [see Gumbel (I960)] and the model of Hiisler and 
Reiss 

(4.9) A{t) = (1 - t)ci> + 1 log +t^[^ + Ye T^i) ' 

where G (0, oo) and $ is the standard normal distribution function [see 
Hiisler and Reiss (1989)]. The coefficient of tail dependence in (4.9) is given 
hy p = 2(1 — $(6*)), that is, independence is obtained for — )• oo and complete 
dependence for 6* — )• 0. For the Gumbel model (4.8), complete dependence 
is obtained in the limit as approaches infinity while independence corre- 
sponds to 9 = 1. The coefficient of tail dependence p = 2(1 — A(0.5)) is given 
by /9 = 2-2V«. 

We generated 1,000 random samples of sample size n = 200 from various 
copula models and calculated the probability of rejecting the null hypothe- 
sis. Under the null hypothesis, we chose the model parameters in such a way 
that the coefficient of tail dependence p varies over the specific range of the 
corresponding model. Under the alternative, the coefficient of tail depen- 
dence does not need to exist and we therefore chose the model parameters, 
such that Kendall's r is an element of the set {1/4,1/2,3/4}. The weight 
function is chosen as /io.4(y) = —y^''^/^og{y) and the critical values are de- 
termined by the multiplier bootstrap approach as described in Section 4.2 
with B = 200 Bootstrap replications. The results are stated in Table 1. 

We observe from the left part of Table 1 that the level of test is accurately 
approximated for most of the models, if the tail dependence is not too strong. 
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Table 1 

Simulated rejection probabilities of the test (4-7) for the null hypothesis of an 
extreme-value copula for various models. The first four columns deal with models under 
the null hypothesis, while the last four are from the alternative 



Ho-iaodel 


P 


0.05 


0.1 


J/i -model 


T 


0.05 


0.1 


Independence 





0.031 


0.075 


Clayton 


0.25 


0.874 


0.916 


Gumbel 


0.25 


0.045 


0.098 




0.5 


1 


1 




0.5 


0.029 


0.066 




0.75 


0.999 


1 




0.75 


0.025 


0.065 


Frank 


0.25 


0.291 


0.396 


Mixed model 


0.25 


0.043 


0.09 




0.5 


0.73 


0.822 




0.5 


0.047 


0.10 




0.75 


0.783 


0.898 


Asy. Neg. Log. 


0.25 


0.041 


0.09 


Gaussian 


0.25 


0.168 


0.240 


0.5 


0.038 


0.077 




0.5 


0.237 


0.336 












0.75 


0.084 


0.156 


Hiisler-Reifi 


0.25 


0.04 


0.091 






0.5 


0.045 


0.089 


ti 


0.25 


0.105 


0.187 




0.75 


0.009 


0.053 




0.5 


0.158 


0.263 












0.75 


0.046 


0.092 



For a large tail dependence coefficient the bootstrap test is conservative. This 
phenomenon can be explained by the fact that for the limiting case of random 
variables distributed according to the upper Frechet— Hoeffding the empirical 
copula Cn does not converge weakly to a nondegenerate process at a rate 
1/y/n, rather in this case it follows that — C|| = 0(l/n). Consequently, 
the approximations proposed in this paper, which are based on the weak 
convergence of ^/n{Cn — C) to a nondegenerate process, are not appropriate 
for small samples, if the tail dependence coefficient is large. Considering the 
alternative, we observe reasonably good power for the Frank and Clayton 
copulas, while for the Gaussian or t-copula deviations from an extreme- value 
copula are not detected well with a sample size n = 200. In some cases, the 
power of the test (4.7) is close the nominal level. This observation can be 
again explained by the closeness to the upper Frechet-Hoeffding bound. 

Indeed, we can use the minimal distance Mh{C,A*) as a measure of de- 
viation from an extreme-value copula. Calculating the minimal distance 
Mh{C,A*) (with Kendall's r = 0.5 and h = /io.4), we observe that the mini- 
mal distances are about ten times smaller for the Gaussian and than for 
the Frank and Clayton copula, that is, 

M,(C7, ^£i,yton) = 1-65 X 10-3, M,(C, = 5-87 x 10-^ 

A/4(C,^^,,,,i,J = 2.08 X 10-^ Mj^iCAl) = 1.18 x 10"^ 

Moreover, as explained in Remark 4.3(b) the power of the tests (4.2) 
and (4.7) is an increasing function of the quantity p(copula) = M/i(C, A*)a~^. 
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For the four copulas considered in the simulation study (with r = 0.5), the 
corresponding ratios are approximately given by 

p(Clayton) = 0.230, p(Frank) = 0.134, 
p(Gaussian) = 0.083, ^(^4) = 0.064, 

which provides some theoretical explanation of the findings presented in 
Table 1. Loosely speaking, if the value Mh{C, A*)a~^ is very small a larger 
sample size is required to detect a deviation from an extreme- value copula. 
This statement is confirmed by further simulations results. For example, 
for the Gaussian and t4 copula (with Kendall's r = 0.75) we obtain for the 
sample size n = 500 the rejection probabilities 0.766 (0.629) and 0.40 (0.544) 
for the bootstrap test with level 5% (10%), respectively. 

APPENDIX A: PROOFS 

Proof of Theorem 3.1. Fix A > 1 as in (3.11) and 7 G (1/2, A/2). 
Due to Lemma 1.10.2(i) in Van der Vaart and Wellner (1996), the process 
\pri{Cn — C) will have the same weak limit (with respect to the conver- 
gence) as -y/n(C'n — C). 

For i = 2,3, . . . , we consider the following random functions in Z°°[0, 1]: 

Wn{t) = C V^(log Cn{y^-\y') - log C{y^-\y'))w{y, t) dy, 
Jo 

Wi,n{t) = C ^{\ogCn{y^-\y')-\ogC{y^-\y'))w{y,t)dy, 
Ji/i 



Wiit) 



1 Gciy'-' 



' i„t ' n w{y,t)dy. 

i/i C{y^ *,y*) 



,y 

l/i 

We prove the theorem by an application of Theorem 4.2 in Billingsley (1968), 
adapted to the concept of weak convergence in the sense of Hoffmann- 
J0rgensen, see, for example, Van der Vaart and Wellner (1996). More pre- 
cisely, we will show in Lemma B.l in Appendix B that the weak convergence 
Wn ~-^W m. Z°°[0, 1] follows from the following three assertions: 

(i) For every i > 2 Wi,„ for n 00 in /°°[0, 1], 

(ii) Wi^W for i^cx) in /°°[0,ll, 

(A.l) 

(iii) For every e > 



lim limsupP* sup |Wi,„,(t) - Wn{t)\ > e = 0. 
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The main part of the proof now consists in the verification assertion (iii) . 

We begin by proving assertion (i). For this purpose, set Tj = 1]^ and 
consider the mapping 

/h^logo/, 

where its domain B$j is defined by = {/ G l°°{Ti) : inf^eTj/(2;)| > 0} C 
l°°{Ti). By Lemma 12.2 in Kosorok (2008), it fohows that is Hadamard- 
differentiable at C, tangentially to l°°{Ti), with derivative ^'^^{f) = f/C. 

Since Cn > n~'^ and C > 11 we have Cn,C £ and the functional delta 
method [see Theorem 2.8 in Kosorok (2008)] yields 

V^{\ogCn-logC)^Gc/C 
in l°°{Ti). Next, we consider the operator 

rz~(r,)^/-([iA,i]x[o,i]), 

where the mapping [1/z, 1] x [0, 1] — )■ Ti is defined by ip{y,t) = (y-^~*,y*). 
Observing 

sup \f oip{y,t) - goip{y,t)\ < sup |/(x) -g(x)| 

{j/,i)G[l/j,l]x[0,l] x6Ti 

we can conclude that <^2 is Lipschitz-continuous. By the continuous mapping 
theorem [see, e.g.. Theorem 7.7 in Kosorok (2008)] and conditions (3.9) 
and (3.10), we immediately obtain 

V^(log Cn{y'-',y') - log C(yi-*, y'))w{y, t) ^ '^''/f / f^ (j/, t) 

C{y^ Sr) 

in 1] X [0, 1]). The assertion in (i) now follows by continuity of inte- 

gration with respect to the variable y. 

For the proof of assertion (ii), we simply note that Gc is bounded on 
[0, 1]^ and that 

c{y^ *,r) 

is uniformly bounded with respect to t G [0,1] by the integrable function 
K{y) =w{y)y-'^. 

For the proof of assertion (iii), choose some a G (0, 1/2) such that Aa > 7 
and consider the decomposition 

Wn{t)-Wi,n{t) 

(A.2) = C'^ ^{\ogCn{y^~\y')-\ogC{y^~\y'))w{y,t)dy 

Jo 
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where 

(A.3) = 
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V^log -^iy^~\y')w{y,t)dy, j = l,2, 



and 



(A.4) 



i3p)(t) 



^Bl'\t) = {0 < y < l/i\C{y'~\y') > n""}, 



The usual estimate 



(A.5) 



sup \Wi,n{t)-Wn{t)\>e 

ie[o,i] 



<P*( sup >e/2) +P*( sup \Bl^'{t)\>e/2 



te[o,i] 



?{2)/ 



tG[0,l] 



allows for individual investigation of both expressions, and we begin with the 
term supjg[Q^^]|i?j^^^(t)|. By the mean value theorem applied to the logarithm, 
we have 



(A.6) 



log ^(2/'"*, y*) = log Cn{y'~\ y*) - log C{y'~\y') 



(Cn-C){y'-\y^) 



C*{y,t) 



where C*{y,t) is some intermediate point satisfying \C*{y,t) — C{y^ ^ 
\Cn{y^~*^,y*^) — C{y^~^ ,y*)\. Especially, observing C > 11 we have 

(A.7) C*iy,t) > (C AC'„,)(yi-*,y*) >y A fy^(yi-*,y*) 



C 



and therefore 
sup 

tG[0,l] 



snp\Bi'\t)\< sup / V^\iCn-C){y'~',y' 



iy^iy''\y'] 



w{y,t)y ^dy 



< sup A/n|C'„(x) - C(x) 
xe[o,i]2 



X I 1 V sup 

xe[0,l]2:C(x)>n- 



C_ 

Cn. 



X 
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with ^p(i) = Jq^^ w{y)y ^ dy = o(l) for i — >• oo. This yields for the first term 
on the right-hand side of (A. 5) 



sup >e) <P*( sup V^|C„(x)-C(x)| > 



(A. 



tG[0,l] 



.xe[o,i]2 



+ P* 1 V sup 

\ C{x)>n-" 



— (x) 

Cn. 



> 



Since supxg[o,i]2 -v/n|C„(x) — C(x)| is asymptotically tight, we immediately 
obtain 



(A.9) limlimsupP* sup ^/^\Cn{x) - C {x)\ > . , ,,, 

i^oo n^oo \xG[0,l]2 V ^ 

For the estimation of the second term in (A. 8), we note that 

^(x) - C(x) 



0. 



(A. 10) sup 

xe[0,l]2 : C(x)>n-" 

which in turn implies 

C , 



C(x) 



<n° sup |C„(x)-C(x) 
xe[o,i]2 



sup 

C(x)>n-" 



Cn 



X 



sup 

C(x)>n-°= 



1 + 



Cn ~ C 



(A.ll) 



< I 1 — sup 

C(x)>n-°= 



C 

Cn ~ C 



(x) 



X 



+ ( sup 

^C{x)>n~a 



1 + 



c 

Cn ~ C 

'c~ 



(x) 



I C'n C I 



where = {sup,^(x)>n-" | ^"(j ^ (x) | < 1/2}. This implies that the function 
max{l, sup(^(x)>n-" I ^(x)|} can be bounded by a function that converges 
to one in outer probability, and thus 



lim limsupP* 1 V sup 

i^oo „,_>oo \ C(x)>n-°= 









;^(x) 




jiii) ] 



0. 



Observing (A. 8) and (A.9) it remains to estimate the second term on the 
right-hand side of (A. 5). We make use of the mean value theorem again 
[see (A. 6)] but use the estimate 

(A.12) C*{y,t) > iCACn){y'-\y') > ^y^^{y'-\y') 
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sup |sf)(t)| < sup 
te[o,i] 



te[o,i]Ji (2) 



V^\{Cn-C){y'-\y')\ 



< sup -v/n|C„(x) — C(x) 

xG[0,l]2 



1 V sup 

xG[0,l]2:C(x)<n- 



1-t 



w{y,t)y dy 



— (x) 

n 



where cl){i) = f^^^w{y)y ^ dy = o{l) for i — oo by condition (3.11). Using 

analogous arguments as for the estimation of sup^gjQ ]^]|-Bj^^^(t)| the assertion 
follows from 



sup 

xe[0,l]2 : C(x)<n-« 



< sup |n^C7^(x)| <n'^-^" = o(l) 

xe[0,l]2:C(x)<n-" 

due to the choice of 7 and a. □ 



Proof of Theorem 3.4. The proof will also be based on Lemma B.l 
in Appendix B verifying conditions (i)-(iii) in (A.l). A careful inspection 
of the previous proof shows that the verification of condition (i) in (A.l) 
remains valid. Regarding condition (ii), we have to show that the process 
%"(y^~*)y*) is integrable on the interval (0, 1). For this purpose, we write 

Gc(x) =Bc;(x) - 5iC(x)Bc7(xi, 1) - d2C{^)Mc{l,X2) 

and consider each term separately. From Theorem G.l in Genest and Segers 
(2009), we know that for any lj £ (0, 1/2) the process 

Bc7(x) 



Bc(x) 



(xi A X2)'^(l - Xi A X2)'^ ' 

0, 



if xi A X2 G (0, 1), 

if xi = or X2 = or X = (1, 1), 

has continuous sample paths on [0, 1]^. Considering C(y^^*,y*) > y and using 
the notation 

(A.13) Ki{y,t) = qUy^~'Ay')y-\ 

(A.14) K2{y, t) = diC{y'-\ y')q^{y'-')y-\ 

(A.15) K;{y,t) = d2C{y'-\y')q^{y')y-' 

with qi^{t) = t^[l — t)'^ it remains to show that there exist integrable func- 
tions K*{y) with Kj{y,t) < K*{y) for all t € [0, 1] {j = 1,2,3). For Ki this 

is immediate because Ki{y,t) < (y^~* A 
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that 5iC(yi~*,y*) = with = A{t) - tA'{t). Therefore, 

(A.16) K2{y,t) < < fi{t)y'^/^^^ < 2y'^/^~\ 

where the second estimate follows from the inequality t\/ (1 — t) < A(t) <1 
and holds for uj G (0,2). A similar argument works for the term K^. 

For the verification of condition (iii), we proceed along similar lines as in 
the previous proof. We begin by choosing some /3 G (l,9/8),a; € (1/4,1/2) 
and some a E (4/9,7 A (2 — uj)~^) in such a way that 7 < /3a. First, note 
that y<l/{n + 2f implies Cn{y^^\y^) = n-~^ for aU t G [0, 1]. This yields 

-(n+2)-2 

V^(log Cn - log C) (yi-* ,y')dy = 0[ 



logn 



3/2 



n 

uniformly with respect to t G [0, 1], and therefore it is sufficient to consider 
the decomposition in (A. 2) with the sets 

/^a)(,) = {l/(n + 2f<y< l/i\C{y^-\y') > n""}, 
/^p)^^^ = (l/(n + 2)2,l/i)\/^a)^^^. 
We can estimate the term B^^\t) analogously to the previous proof by 



V^\{Cn-C){y'~\y')\x 



C 



lV^{y'-\y') 



y ^dy. 



Let Hn denote the empirical distribution function of the standardized sam- 
ple (F(Xi),G(Fi)),...,(F(X„),G(y„)). By the results in Segers [(2010), 
Section 5] we can decompose ^Jn{Cn — C) = \/n{Cn V n~'^ — C) as follows: 

V^{Cn - C)(x) = ^{Cn - C)(x) + ^{Cn " C„)(x) 

(A.17) = a„(x) - 5iC(x)a„(xi, 1) - 52C(x)a„(l, X2) 

+ ^n(x), 

where a„(x) = ^/n(Hn — C){x) and the remainder satisfies 

(A.18) sup \Rn{x)\=0{n^/^-^ + n-^/\logn)^/^{\oglognf/^) a.s. 

x6[0,l]2 

Note that the estimate of (A.18) requires validity of condition 5.1 in Segers 
(2010). This condition is satisfied provided that the function A is assumed 
to be twice continuously differentiable; see Example 6.3 in Segers (2010). 

With (A.17), we can estimate the term |i?|"'^^(t)| analogously to decomposi- 
tion (A.2) by B^^^it) + ■■■ + 5jJ(t), where 

C 



\aniy'^\y')\ 



iy^iy'~\y') 



y ^dy, 
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BSit) 



5iC7(yi"*,y*)|a„,(yi"*,l)| 
d2C{y'-\y')\an{l,y')\ 



c 



y ^dy, 
y'^dy, 



c 



iy^{y'~\y'] 



y ^dy. 



The decomposition in (A. 17), Theorem G.l in Genest and Segers (2009) and 
the inequahty q < 7 A (2 — may be used to conclude 



sup 

(s/,t):C(s/l-t,s/*)>n— 



c 



which in turn imphes 



(A.19) 



1 V sup 

(y,t):C(s/i-*,y*)>n- 



^{y'-\y') 

Cn, 



■ op*(l), 



Op.(l) 



analogously to (A.ll). Together with (A. 18) and observing the inequality 

rl/i 
'(n+2)- 



■/(n+2)-2 y ^ dy <2 log(n + 2), we obtain, for n — )• 00 

sup Bf2(i) = Op*(^^^^"^log"- + n-i/^(logn)3/2(loglogn)i/^) = op.(l), 

tG[0,l] 

which implies 

(A.20) lim limsupP*( sup 5f2(t) > e/4) = 0. 



j-i-oo n-i-oo 



te[o,i] 



Observing that quj{y^ * A y*) < y"^/^ the first term B^^^{t) can be estimated 
by 



sup-B^>/(*)< sup — 

tG[0,l] xe[0,l]2 AX2j 

X I 1 V sup 



^{y'^\y') 



X V(i), 



where ^(i) = Jq^^ y ^^^^"^ dy = o(l) for i — )• 00. Using analogous arguments 
as in the previous proof we can conclude, using of (A.19) and Theorem G.l 

in Genest and Segers (2009), that limj^oo lim sup„_^; 
e/4) =0. For the second summand, we note that 



sup Bl ^ [t] < sup r-^ X I 1 V 



sup 

iy,t):Ciy^-\yt)>n- 



'(supie[o,i] Bi^it)> 

^{y'-\y') 
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X sup / K2{y,t)dy, 

where K2{y,t) is defined in (A. 14). Observing the estimate in (A. 16), we 
easily obtain limi^ooSup^^^Q i^ J^^^ K2{y,t) dy = 0. Again, under consider- 
ation of (A. 19) and Theorem G.l in Genest and Segers (2009), we have 
linij^oo hmsup„_^Q^ P*(supjg[Q B^]^{t) > e/4) = 0. A similar argument works 
for i^l 3^ and from the estimates for the different terms the assertion 

lim limsupP*( sup |sf ^(t)| > e) = 

i-S-oo n->-oo ^te[0,l] ^ 

(2) 

follows. Considering the term suptg[o,i]l-^i (^)l) '^^ proceed along similar 
lines as in the proof of Theorem 3.1. For the sake of brevity, we only state 
the important differences: in estimation (A. 12) replace A by /3, then make use 
of decomposition (A. 17), calculations similar to (A. 16), and Theorem G.l 
in Genest and Segers (2009) again and for the estimation of the remainder 

note that ///(t+a)^ V'^ dy = 0(n2(/3-i) ) . □ 

Proof of Theorem 3.8. Let rj denote a probability measure minimiz- 
ing the functional V defined in (3.19). Note that V is convex and define for 
a E [0, 1] and a further probability measure on [0, 1] the function 

g{a) = ViaC + il-a)7]). 

Because V is convex it follows that rj is optimal if and only if the directional 
derivative of rj in the direction ^ — satisfies 

< 9'{0+) = lim ii^tlM. 

= 2 [ [ kt{x,y)d^{x)di]{y) 
Jo Jo 

-2 / kt{x,y)di]{x)dri{y) 
Jo Jo 

for all probability measures ^. Using Dirac measures for ^ yields that this 
inequality is equivalent to (3.20), which proves Theorem 3.8. □ 

Proof of Theorem 4.1. Since the integration mapping is continuous, 
it suffices to establish the weak convergence Wnit) ^ W{t) in /°°[0, 1] where 
we define 

Wn{t) = j\ (log ^ci^Jt'yty ) '^(y) dy - nBh{An,H{t) - A*{t))\ 
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We prove this assertion along similar lines as in the proof of Theorem 3.1. For 
i>2,we recall the notation w{y) = /i*(y)/(log?/)^ and consider the following 
random functions in l°°[0, 1]: 

" Ua Ciy^-',y') logy ^ 

By an application of Lemma B.l in Appendix B, it suffices to show the 
conditions listed in (A.l). By arguments similar to those in the proof of 
Theorem 3.1, we obtain 

fA9U ^..^ Cn{y'-',y') ^ Gc{y'~\y') 

in /°°([l/z,l] X [0,1]). Assertion (i) now follows immediately by the bound- 
edness of the functions w{y) and /i* (?/)(— logy)" ^ on [l/i,l] [see condi- 
tions (3.9), (3.10) and (3.13)] and the continuous mapping theorem. 

For the proof of assertion (ii), we simply note that and Gc are 

bounded on [0,1]^ and Ki{y,t) = ^i^^H^yt^ and K2{y,t) = c{y^-^]yt) 
bounded uniformly with respect to t € [0, 1] by the integrable functions 
Ki{y) = w{y)y-'^ and K2{y) = h*{y){-logy)-'^y~^ . 

For the proof of assertion (iii), we fix some a G (0, 1/2) such that Xa > 27 
and consider the decomposition 

(A.22) Wn{t) - W,,n{t) = Bi'\t) + i?f )(t) + B^it), 

where 

(A,24) Bfir^^l „(.o.||^)%(,),, 

(A.25) (i) = -B-'l{t, l/i){2I{t, 1) - I{t, l/i)), 
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I (1) (t) and / (2) (t) are defined in (A. 4) and 



I{t, a) = y/n log ) dy. 



V C{y^-\y^) J logy 

By the same arguments as in tlie proof of Theorem 3.1, we have for every 
e>0 



lim limsupP* sup \I{t,l/i)\ > e = 0, 

i->oo rn-oo ^te[0,l] ^ 

and supjg[Q 1)1 =Op*(l), which yields the asymptotic negligibility 

limj_j.oo linisup„^oj3 P*(supjg[Q \Bf'\t)\ > e) = 0. For Bf'\t), we obtain the 
estimate 



sup \B'i'\t)\ 

tG[0,l] 



< sup 



t6[0,l]^/^(l)^ 



n\{Cn-C){y^~\y')? 



w{y)y ^dy 



< sup ?i|C'„(x)-C(x)|2x IV 



xg[0,l] 



sup 

xe[0,l]2:C(x)>n- 



(x) 



X 



where := Jq w{y)y ^ dy, which can be handled by the same arguments 
as in the proof of Theorem 3.1. Finally, the term (t) can be estimated 
by 

sup \Bf\t)\ 



ie[o,i] 



< sup / n\iCn-C)iy^^\y') 



lV^{y'~\y') 



w{y)y ^dy 



< sup n|C'„(x) -C(x)|2 X ( 1 V 
xe[o,i]2 



sup 

xe[o,iP ;C(x)<n- 



^^(x) 



X (^(i). 



where </)(i) = J^^^ w{y)y ^ dy = o(l) for i — )■ oo by condition (3.11). Mimic- 
king the arguments from the proof of Theorem 3.1 completes the proof. □ 



Proof of Theorem 4.2. RecaU the decomposition Mh{Cn,An,h) - 
Mh{C, A*) = Si + S2 + S3 where 81,82 and ^3 are defined in (4.1). With 
the notation v{y) := 2/i*(y)/(— logy) it follows that \v{y,t)\ <v{y) and the 
assumptions on h yield the validity of (3.9)-(3.11) for v{y,t). This allows for 
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an application of Theorem 3.1 and together with the continuous mapping 
theorem we obtain y/nSi ^ Zi, where Zi is the hmiting process defined 
in (4.2). Thus, it remains to verify the neghgibihty of 5*2 + S3. For ^3, we 
note that by Theorem 3.2 and the continuous mapping theorem we have 
S3 = Op*(l/n) and it remains to consider 5*2. To this end, we fix some 
a G (0,1/2) such that (1 + (A — l)/2)a > 7 and consider the decomposi- 
tion 



C{y'-\y^) {\ogyf 

L '''' 



+ 



Cn{y'-',y') h*{y) 
log^ 



C(yi-*,y*) (log2/)2 
Cn{y'^',y') h*{y) 



dy 



C{y^-\y^) [\ogyf 
--■.Ti{t,n) + T2{t,n), 



dy 



where the sets / 
use the estimate 

Cniy'-',y') 



= 1)2 are defined in (A. 4). On the set /^(i)^^^, we 



log' 



< 



\Cn-C \\ i_ 
(C*)2 



{y'^\y')<n 



a\Cn-C\^ 1 



c* 



1 A Cn/C 



y') 



g(x) 

xe[0,l]2:C{x)>n"° C'n(x) 



sup 



where \C*{y,t) - C{y^ *,t/*)| < \Cn{y'^ *,?/*) - C{y^ *,y*)|. By arguments 
similar to those used in the proof of Theorem 3.1, it is now easy to see that 

"v/nsup \Ti{t,n)\< sup n°+i/2|C'„(x)-C(x)|2 
t xe[o,ip 



X I 1 V sup 

V x6[0,l]2:C(x)>n- 

op*(l). 



;^(x) 



X K 



where K := w{y)y ^ dy < 00 denotes a finite constant [see condition (3.11)]. 
Now set /3 := (A — l)/2 > 0. From the estimate 

C*{y,t) > yi+^(l A ^{y'~\y')^ =y~^'{l A ^{y'-\y')) 

(2) 

we obtain by similar arguments as in the proof of the neghgibihty of {t)\ 
in the proof of Theorem 3.1 (note that on -^0(2) /,^ we have y < C{y^~^, 

r>^ [t) 



42 



A. BtiCHER, H. DETTE AND S. VOLGUSHEV 



y*) < ) 

sup |r2(t,n)| < log(?i)n-'^" sup ^/^|(7„(x) - C(x) 



tG[0,l] 



xG[0,l] 

X ( 1 V sup 

xe[0,l]2:C(x)<n- 



Cn 



X 



X K, 



where ^ := 7 /q^(1 — log y) ^og^J^i U ^ dy denotes a finite constant [see condi- 
tions (3.11) and (3.14)] and we used the estimate 



log 



Cn{y'-\y') 



C{y^-\y') 



< (7 log n - log y) 



log 



C{y^-\y') 



<7log(n)(l - logy) 



log 



Cn{y^-\y') 



C{y^-\y' 

which holds for sufficiently large n. Finally, we observe that 



sup 

xe[0,l]2:C(x)<n-a 



Cn 



"(x) 



< sup |n^C7i+^(x)| <n^-(i+^)"=o(l). 

x: C(x)<n-°= 



Now the proof is complete. □ 

Proof of Theorem 4.4. The conditions on the weight function imply 
that all integrals in the definition of Zq are proper and therefore the mapping 
(Gc, C) I—)- Zo(Gc, C) is continuous. Hence, the result follows by the contin- 
uous mapping theorem for the bootstrap [see, e.g.. Theorem 10.8 in Kosorok 
(2008)] provided the conditional weak convergence in (4.4) holds under the 
nonrestrictive smoothness assumption (3.6). To see this, proceed similar as 
in Blicher and Dette (2010) and show Hadamard-differentiability of the map- 
ping if I—)- H{H^ ,H2), which is defined for some distribution function H 
on the unit square whose marginals Hi = H{-, 1) and H2 = H{1, •) satisfy 
Hi{0) = ^^2(0) = 0. This can be done by similar arguments as in Segers 
(2010) and the details are omitted for the sake of brevity. □ 



APPENDIX B: AN AUXILIARY RESULT 

Lemma B.l. Let A'„,A'j^„:0 — ^ D for i,n € N 6e arbitrary maps with 
values in the metric space (B,(i) and Xi,X -.0, ^ 1} be Borel- measurable. 
Suppose that: 

(i) For every iGN Xi^n'^^i /or n—?- 00, 

(ii) Xi X for i — )• 00, 

(iii) For every e> lim limsupP*((i(A'j^„, X^) > e) = 0. 
Then Xn X for n — )■ 00 . 
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Proof. Let F C D be closed and fix e > 0. If = {x G D : d{x, F) < e) 
denotes tlie e-enlargement of F we obtain 

P*(^n e F) < G F^) + P*(d(X,,„,X„) > e). 

By liypothesis (i) and tlie Portmanteau theorem [see Van der Vaart and 
Wellner (1996)] 

limsupP*(X„ eF)< F{Xi e F^) + limsupP*((i(Xi,„,X„) > e). 

n—^oo n—^oo 

By conditions (ii) and (iii) lim sup.„^oo IP* (-^n ^ F) < P{X G F^) and since 
F^ \^F for e 4, and closed F the result follows by the Portmanteau theorem. 
□ 

Acknowledgments. The authors would like to thank Martina Stein, who 
typed parts of this manuscript with considerable technical expertise. The 
authors would also like to thank Christian Genest for pointing out important 
references and Johan Segers for many fruitful discussions on the subject. We 
are also grateful to two unknown referees and an Associate Editor for their 
constructive comments on an earlier version of this manuscript, which led 
to a substantial improvement of the paper. 

REFERENCES 

Ben Ghorbal, N., Genest, C. and Neslehova, J. (2009). On the Ghoudi, Khoudraji, 
and Rivest test for extreme-value dependence. Canad. J. Statist. 37 534-552. 
MR2588948 

BiLLlNGSLEY, P. (1968). Convergence of Probability Measures. Wiley, New York. 
MR0233396 

BiiCHER, A. and Dette, H. (2010). A note on bootstrap approximations for the empirical 

copula process. Statist. Probab. Lett. 80 1925-1932. 
BiicHER, A., Dette, H. and Volgushev, S. (2010). New estimators of the Pickands 

dependence function and a test for extreme-value dependence. Available at http:// 

www . ruhr-uni-bochum.de/mathematik3/research/index . html . 
Caperaa, p., Fougeres, a. L. and Genest, C. (1997). A nonparametric estimation 

procedure for bivariate extreme value copulas. Biometrika 84 567-577. MR1603985 
Cebrian, a., Denuit, M. and Lambert, P. (2003). Analysis of bivariate tail dependence 

using extreme values copulas: An application to the SOA medical large claims database. 

Belgian Actuarial Journal 3 33-41. 
Coles, S., Heffernan, J. and Tawn, J. (1999). Dependence measures for extreme value 

analyses. Extremes 2 339-365. 
Deheuvels, p. (1984). Probabilistic aspects of multivariate extremes. In Statistical 

Extremes and Applications (J. T. de Oliveira, ed.) 117-130. Reidel, Dordrecht. 

MR0784817 

Deheuvels, P. (1991). On the limiting behavior of the Pickands estimator for bivariate 
extreme- value distributions. Statist. Probab. Lett. 12 429-439. MRl 142097 

Fermanian, J.-D., Radulovic, D. and Wegkamp, M. (2004). Weak convergence of 
empirical copula processes. Bernoulli 10 847-860. MR2093613 



44 



A. BtiCHER, H. DETTE AND S. VOLGUSHEV 



FiLS-VlLLETARD, A., GuiLLOU, A. and Segers, J. (2008). Projection estimators of 
Pickands dependence functions. Canad. J. Statist. 36 369-382. MR2456011 

Genest, C. and Segers, J. (2009). Rank-based inference for bivariate extreme-value 
copulas. Ann. Statist. 37 2990-3022. MR2541453 

Ghoudi, K., Khoudraji, A. and Rivest, L. (1998). Proprietes statistiques des copules 
de valeurs extremes bidimensionnelles. Canad. J. Statist. 26 187-197. MR1624413 

GuMBEL, E. J. (1960). Distributions des valeurs extremes en plusieurs dimensions. Puhl. 
Inst. Statist. Univ. Paris 9 171-173. MR0119279 

Hall, P. and Tajvidi, N. (2000). Distribution and dependence-function estimation for 
bivariate extreme- value distributions. Bernoulli 6 835-844. MR1791904 

HsiNG, T. (1989). Extreme value theory for multivariate stationary sequences. J. Multi- 
variate Anal. 29 274-291. MR1004339 

HuSLER, J. and Reiss, R.-D. (1989). Maxima of normal random vectors: Between inde- 
pendence and complete dependence. Statist. Probab. Lett. 7 283-286. MR0980699 

Jimenez, J. R., Villa-Diharce, E. and Flores, M. (2001). Nonparametric estimation 
of the dependence function in bivariate extreme value distributions. J. Multivariate 
Anal. 76 159-191. MR1821817 

Joe, H. (1990). Families of min-stable multivariate exponential and multivariate extreme 
value distributions. Statist. Probab. Lett. 9 75-81. MR1035994 

Karatzoglou, a., Smola, A., Hornik, K. and Zeileis, A. (2004). Kernlab — an S4 
package for Kernel methods in R. Journal of Statistical Software 11 1-20. 

KOJADINOVIC, I. and Yan, J. (2010). Nonparametric rank-based tests of bivariate 
extreme-value dependence. J. Multivariate Anal. 101 2234-2249. MR2671214 

KOSOROK, M. R. (2008). Introduction to Empirical Processes and Semiparametric Infer- 
ence. Springer, New York. MR2724368 

Marshall, A. W. (1970). Discussion of Barlow and van Zwet's papers. In Nonparametric 
Techniques in Statistical Inference (M. L. PuDi, ed.) 175-176. Cambridge Univ. Press, 
London. MR0273755 

Nelsen, R. B. (2006). An Introduction to Copulas, 2nd ed. Springer, New York. 
MR2 197664 

NOCEDAL, J. and Wright, S. J. (2006). Numerical Optimization, 2nd ed. Springer, New 
York. MR2244940 

Pickands, J. (1981). Multivariate extreme value distributions (with a discussion). In 

Proceedings of the 43rd Session of the International Statistical Institute. Bull. Inst. 

Intemat. Statist. 49 859-878, 894-902. MR0820979 
Robertson, T., Wright, F. T. and Dykstra, R. L. (1996). Ordered Restricted Statis- 
tical Inference. Wiley, New York. 
RUSCHENDORF, L. (1976). Asymptotic distributions of multivariate rank order statistics. 

Ann. Statist. 4 912-923. MR0420794 
SCAILLET, O. (2005). A Kolmogorov-Smirnov type test for positive quadrant dependence. 

Canad. J. Statist. 33 415-427. MR2193983 
Segers, J. (2007). Nonparametric inference for bivariate extreme-value copulas. In Topics 

in Extreme Values (M. Ahsanullah and S. N. U. A. Kirmani, eds.) 185-207. Nova 

Science Publishers, New York. 
Segers, J. (2010). Weak convergence of empirical copula processes under nonrestrictive 

smoothness assumptions. Available at arXiv:1012.2133vl. 
Sklar, M. (1959). Fonctions de repartition a n dimensions et leurs marges. Publ. Inst. 

Statist. Univ. Pans 8 229-231. MR0125600 
Strimmer, K. (2009). fdrtool: Estimation and control of (local) false discovery rates. 

R package Version 1.2.6. 



NEW ESTIMATORS OF PICKANDS DEPENDENCE FUNCTION 45 



Tawn, J. A. (1988). Bivariate extreme value theory: Models and estimation. Biometnka 
75 397-415. MR0967580 

TSUKAHARA, H. (2005). Semiparametric estimation in copula models. Canad. J. Statist. 
33 357-375. MR2193980 

Van der Vaart, A. W. and Wellner, J. A. (1996). Weak Convergence and Empirical 
Processes. Springer, New York. MR1385671 

Wang, J. L. (1986). Asymptotically minimax estimators for distributions with increasing 
failure rate. Ann. Statist. 14 1113-1131. MR0856809 

Zhang, D., Wells, M. T. and Peng, L. (2008). Nonparametric estimation of the de- 
pendence function for a multivariate extreme value distribution. J. Multivariate Anal. 
99 577-588. MR2406072 



Fakultat fOr Mathematik 
Ruhr-Universitat Bochum 
Universitatsstrasse 150 
44780 Bochum 
Germany 

E-MAIL: axel.buecherOruhr-uni-bochum.de 
holgor.dettc@ruhr-uni-bochum.dc 
stanislav.volgushev@ruhr-uni-bochum.de 



